Search CORE

696 research outputs found

On the Use of Perceptual Properties for Melody Estimation

Author: Liao Wei-Hsiang
Roebel Axel
Su Alvin. Wen-Yu
Yeh Chunghsin
Publication venue: HAL CCSD
Publication date: 01/01/2011
Field of study

cote interne IRCAM: Liao11aInternational audienceThis paper is about the use of perceptual principles for melody estimation. The melody stream is understood as generated by the most dominant source. Since the source with the strongest energy may not be perceptually the most dominant one, it is proposed to study the perceptual properties for melody estimation: loudness, masking effect and timbre similarity. The related criteria are integrated into a melody estimation system and their respective contributions are evaluated. The effectiveness of these perceptual criteria is confirmed by the evaluation results using more than one hundred excerpts of music recordings

Automatic Piano Transcription with Hierarchical Frequency-Time Transformer

Author: Akama Taketo
Ikemiya Yukara
Liao Wei-Hsiang
Mitsufuji Yuki
Takida Yuhta
Toyama Keisuke
Publication venue
Publication date: 09/07/2023
Field of study

Taking long-term spectral and temporal dependencies into account is essential for automatic piano transcription. This is especially helpful when determining the precise onset and offset for each note in the polyphonic piano content. In this case, we may rely on the capability of self-attention mechanism in Transformers to capture these long-term dependencies in the frequency and time axes. In this work, we propose hFT-Transformer, which is an automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture. The first hierarchy includes a convolutional block in the time axis, a Transformer encoder in the frequency axis, and a Transformer decoder that converts the dimension in the frequency axis. The output is then fed into the second hierarchy which consists of another Transformer encoder in the time axis. We evaluated our method with the widely used MAPS and MAESTRO v3.0.0 datasets, and it demonstrated state-of-the-art performance on all the F1-scores of the metrics among Frame, Note, Note with Offset, and Note with Offset and Velocity estimations.Comment: 8 pages, 6 figures, to be published in ISMIR202

arXiv.org e-Print Archive

Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects

Author: Koo Junghyun
Lee Kyogu
Liao Wei-Hsiang
Martinez-Ramirez Marco A.
Mitsufuji Yuki
Uhlich Stefan
Publication venue
Publication date: 03/11/2022
Field of study

We propose an end-to-end music mixing style transfer system that converts the mixing style of an input multitrack to that of a reference song. This is achieved with an encoder pre-trained with a contrastive objective to extract only audio effects related information from a reference music recording. All our models are trained in a self-supervised manner from an already-processed wet multitrack dataset with an effective data preprocessing method that alleviates the data scarcity of obtaining unprocessed dry data. We analyze the proposed encoder for the disentanglement capability of audio effects and also validate its performance for mixing style transfer through both objective and subjective evaluations. From the results, we show the proposed system not only converts the mixing style of multitrack audio close to a reference but is also robust with mixture-wise style transfer upon using a music source separation model

arXiv.org e-Print Archive

VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance

Author: Hernandez-Olivan Carlos
Lai Chieh-Hsin
Liao Wei-Hsiang
Martínez-Ramirez Marco A.
Mitsufuji Yuki
Murata Naoki
Saito Koichi
Publication venue
Publication date: 13/09/2023
Field of study

Restoring degraded music signals is essential to enhance audio quality for downstream music manipulation. Recent diffusion-based music restoration methods have demonstrated impressive performance, and among them, diffusion posterior sampling (DPS) stands out given its intrinsic properties, making it versatile across various restoration tasks. In this paper, we identify that there are potential issues which will degrade current DPS-based methods' performance and introduce the way to mitigate the issues inspired by diverse diffusion guidance techniques including the RePaint (RP) strategy and the Pseudoinverse-Guided Diffusion Models (

\Pi

GDM). We demonstrate our methods for the vocal declipping and bandwidth extension tasks under various levels of distortion and cutoff frequency, respectively. In both tasks, our methods outperform the current DPS-based music restoration benchmarks. We refer to \url{http://carlosholivan.github.io/demos/audio-restoration-2023.html} for examples of the restored audio samples

arXiv.org e-Print Archive

Bacteremic pneumonia caused by Nocardia veterana in an HIV-infected patient

Author: Hsiao Cheng-Hsiang
Hsueh Po-Ren
Huang Yu-Tsung
Hung Chien-Ching
Lai Chih-Cheng
Liao Chun-Hsing
Liu Wei-Lun
Publication venue: International Society for Infectious Diseases. Published by Elsevier Ltd.
Publication date: 30/06/2011
Field of study

SummaryDisseminated Nocardia veterana infection has rarely been reported. We describe the first reported case of N. veterana bacteremic pneumonia in an HIV-infected patient. The isolate was confirmed by 16S rRNA sequencing analysis. The patient initially responded well to trimethoprim–sulfamethoxazole treatment (minimum inhibitory concentration 0.25μg/ml), but died of ventilator-associated pneumonia

Elsevier - Publisher Connector

Kinematic Analyses of a Parallel-type Independently Controllable Transmission

Author: Der-Min Tsay
Guan-Shyong Hwang
Jao-Hwa Kuang
Tzuen-Lih Chern
Wei-Hsiang Liao
Publication venue: 'International Journal of Automation and Smart Technology'
Publication date: 01/09/2011
Field of study

This study proposes a novel design of a parallel-type Independently Controllable Transmission (ICT). The parallel-type ICT can produce a continuously variable transmission ratio and a required angular output velocity that can be independently manipulated by a controller yet not affected by the angular velocity of the input shaft. The proposed parallel-type ICT is composed of two planetary gear trains and two transmission-connecting members. A prototype was built to investigate its kinematic characteristics and verify application feasibility

Directory of Open Access Journals

Ample Pairs

Author: Chang-Jen Huang (176655)
Che-Yi Lin (218292)
Chung-Hsiang Yang (211530)
Hsin-Chi Liao (4254013)
Huang-Wei Lien (176642)
Ming-Yuan Tsai (4254010)
Sheng-Ping Hwang (4254007)
Yi-Chung Chen (171688)
Yu-Fen Lu (171691)
Yu-Hsiu Liu (4254004)
Yun-Ren Lai (4254001)
Publication venue
Publication date: 01/07/2017
Field of study

We show that the ample degree of a stable theory with trivial forking is preserved when we consider the corresponding theory of belles paires, if it exists. This result also applies to the theory of

H

-structures of a trivial theory of rank

1

.Comment: Research partially supported by the program MTM2014-59178-P. The second author conducted research with support of the programme ANR-13-BS01-0006 Valcomo. The third author would like to thank the European Research Council grant 33882

arXiv.org e-Print Archive

Directory of Open Access Journals

FigShare